Pybel: a Python wrapper for the OpenBabel cheminformatics toolkit

نویسندگان

  • Noel M O'Boyle
  • Chris Morley
  • Geoffrey R Hutchison
چکیده

BACKGROUND Scripting languages such as Python are ideally suited to common programming tasks in cheminformatics such as data analysis and parsing information from files. However, for reasons of efficiency, cheminformatics toolkits such as the OpenBabel toolkit are often implemented in compiled languages such as C++. We describe Pybel, a Python module that provides access to the OpenBabel toolkit. RESULTS Pybel wraps the direct toolkit bindings to simplify common tasks such as reading and writing molecular files and calculating fingerprints. Extensive use is made of Python iterators to simplify loops such as that over all the molecules in a file. A Pybel Molecule can be easily interconverted to an OpenBabel OBMol to access those methods or attributes not wrapped by Pybel. CONCLUSION Pybel allows cheminformaticians to rapidly develop Python scripts that manipulate chemical information. It is open source, available cross-platform, and offers the power of the OpenBabel toolkit to Python programmers.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Packaging free software chemistry programs in Debian GNU/Linux: past, present and future

Started in 1993, the Debian project is one of the oldest Free Software projects. Due to its volunteer nature, specialists from all fields contribute to the Debian GNU/ Linux distribution, which includes more than 30000 packages. The Debian packaging policy, its advanced package management system and the conservative release process lead to a stable basis which is ideal for customized environmen...

متن کامل

Cinfony – combining Open Source cheminformatics toolkits behind a common interface

BACKGROUND Open Source cheminformatics toolkits such as OpenBabel, the CDK and the RDKit share the same core functionality but support different sets of file formats and forcefields, and calculate different fingerprints and descriptors. Despite their complementary features, using these toolkits in the same program is difficult as they are implemented in different languages (C++ versus Java), ha...

متن کامل

Ushering the Cactvs Toolkit into the Python Age (without breaking the legacy)

The Cactvs Chemoinformatics Toolkit is probably the most powerful general-purpose chemical information processing toolkit on the market. Since its inception about twenty years ago, its main language for rapid script development has been Tcl – at that time a language at the forefront of lazily-typed rapid prototyping and interface programming language design. While Tcl is still actively maintain...

متن کامل

PyBEL: a computational framework for Biological Expression Language

Summary Biological Expression Language (BEL) assembles knowledge networks from biological relations across multiple modes and scales. Here, we present PyBEL; a software package for parsing, validating, converting, storing, querying, and visualizing networks encoded in BEL. Availability and implementation PyBEL is implemented in platform-independent, universal Python code. Its source is distri...

متن کامل

NTU-MC Toolkit: Annotating a Linguistically Diverse Corpus

The NTU-MC Toolkit is a compilation of tools to annotate the Nanyang Technological University Multilingual Corpus (NTU-MC). The NTU-MC is a parallel corpora of linguistically diverse languages (Arabic, English, Indonesian, Japanese, Korean, Mandarin Chinese, Thai and Vietnamese). The NTU-MC thrives on the mantra of "more data is better data and more annotation is better information". Other than...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Chemistry Central Journal

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2008